首页> 外文OA文献 >Strategy complexity of finite-horizon Markov decision processes and simple stochastic games

【2h】

Strategy complexity of finite-horizon Markov decision processes and simple stochastic games

机译：有限时域马尔可夫决策过程的策略复杂性简单的随机游戏

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Markov decision processes (MDPs) and simple stochastic games (SSGs) provide arich mathematical framework to study many important problems related toprobabilistic systems. MDPs and SSGs with finite-horizon objectives, where thegoal is to maximize the probability to reach a target state in a given finitetime, is a classical and well-studied problem. In this work we consider thestrategy complexity of finite-horizon MDPs and SSGs. We show that for all$\epsilon>0$, the natural class of counter-based strategies require at most$\log \log (\frac{1}{\epsilon}) + n+1$ memory states, and memory of size$\Omega(\log \log (\frac{1}{\epsilon}) + n)$ is required. Thus our bounds areasymptotically optimal. We then study the periodic property of optimalstrategies, and show a sub-exponential lower bound on the period for optimalstrategies.

机译：马尔可夫决策过程（MDP）和简单随机博弈（SSG）提供了丰富的数学框架来研究与概率系统相关的许多重要问题。具有有限水平目标的MDP和SSG是一个经典且经过充分研究的问题，目标是在给定的有限时间内最大化达到目标状态的可能性。在这项工作中，我们考虑了有限水平MDP和SSG的战略复杂性。我们证明，对于所有$ \ epsilon> 0 $，基于计数器策略的自然类最多需要$ \ log \ log（\ frac {1} {\ epsilon}）+ n + 1 $内存状态，以及size $ \ Omega（\ log \ log（\ frac {1} {\ epsilon}）+ n）$是必需的。因此，我们的边界区域渐近最优。然后，我们研究最优策略的周期性，并在最优策略的周期上显示出次指数下界。

著录项

作者
Chatterjee, Krishnendu; Ibsen-Jensen, Rasmus;
展开▼
作者单位

展开▼
年度 2012
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. The complexity of analyzing infinite-state Markov chains, Markov decision processes, and stochastic games (Invited talk) [J] . Kousha Etessami LIPIcs : Leibniz International Proceedings in Informatics . 2013,第1期

机译：分析无限状态马尔可夫链，马尔可夫决策过程和随机博弈的复杂性（特邀演讲）
2. Lexicographic refinements in possibilistic decision trees and finite-horizon Markov decision processes [J] . Ben Amor Nahla, El Khalfi Zeineb, Fargier Helene, Fuzzy sets and systems . 2019,第JULa1期

机译：可能的决策树和有限水平马尔可夫决策过程中的词典细化
3. Reducible Markov Decision Processes and Stochastic Games [J] . Ning Jie Production and operations management . 2021,第8期

机译：还原马尔可夫决策过程和随机游戏
4. Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games [C] . Krishnendu Chatterjee, Rasmus Ibsen-Jensen International doctoral workshop on mathematical and engineering methods in computer science . 2013

机译：有限地平线马尔可夫决策过程的策略复杂性和简单随机博弈
5. Investigation of Computational Reduction Strategies for Markov Decision Processes [D] . Zhai, Jie. 2019

机译：马尔可夫决策过程计算减排策略调查
6. Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [O] . Rajesh P. N. Rao 2010

机译：不确定性下的决策：基于部分可观察的马尔可夫决策过程的神经模型
7. The complexity of analyzing infinite-state Markov chains, Markov decision processes, and stochastic games (Invited talk) [O] . Etessami Kousha 2013

机译：分析无限状态马尔可夫链，马尔可夫决策过程和随机游戏的复杂性（邀请谈话）
8. Effect of Practice on Decision Making in Simple Games with Simple Strategies [R] . Payne, W. H. 1965

机译：简单战略实践对简易游戏决策的影响

Strategy complexity of finite-horizon Markov decision processes and simple stochastic games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅